A comparative study of fairness-enhancing interventions in machine learning

نویسندگان

  • Sorelle A. Friedler
  • Carlos Eduardo Scheidegger
  • Suresh Venkatasubramanian
  • Sonam Choudhary
  • Evan P. Hamilton
  • Derek Roth
چکیده

Computers are increasingly used to make decisions that have significant impact in people’s lives. Often, these predictions can affect different population subgroups disproportionately. As a result, the issue of fairness has received much recent interest, and a number of fairness-enhanced classifiers and predictors have appeared in the literature. This paper seeks to study the following questions: how do these different techniques fundamentally compare to one another, and what accounts for the differences? Specifically, we seek to bring attention to many under-appreciated aspects of such fairness-enhancing interventions. Concretely, we present the results of an open benchmark we have developed that lets us compare a number of different algorithms under a variety of fairness measures, and a large number of existing datasets. We find that although different algorithms tend to prefer specific formulations of fairness preservations, many of these measures strongly correlate with one another. In addition, we find that fairness-preserving algorithms tend to be sensitive to fluctuations in dataset composition (simulated in our benchmark by varying training-test splits), indicating that fairness interventions might be more brittle than previously thought. ∗This work was partially supported by National Science Foundation under grants IIS1633387, IIS-1513651, and IIS-1633724, as well as by a grant from the Ethics and Governance of AI Initiative. Source code, including instructions for adding your own algorithm or dataset, can be found at: https://github.com/algofairness/fairness-comparison †[email protected][email protected] §[email protected][email protected][email protected] ∗∗[email protected] 1 ar X iv :1 80 2. 04 42 2v 1 [ st at .M L ] 1 3 Fe b 20 18

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Expanded HTA: Enhancing Fairness and Legitimacy

All societies face the need to make judgments about what interventions (both public health and personal medical) to provide to their populations under reasonable resource constraints. Their decisions should be informed by good evidence and arguments from health technology assessment (HTA). But if HTA restricts itself to evaluations of safety, efficacy, and cost-effectiveness, it risks being vie...

متن کامل

Fair Processes for Priority Setting: Putting Theory into Practice; Comment on “Expanded HTA: Enhancing Fairness and Legitimacy”

Embedding health technology assessment (HTA) in a fair process has great potential to capture societal values relevant to public reimbursement decisions on health technologies. However, the development of such processes for priority setting has largely been theoretical. In this paper, we provide further practical lead ways on how these processes can be implemented. We first present the misconce...

متن کامل

Comparative Analysis of Machine Learning Algorithms with Optimization Purposes

The field of optimization and machine learning are increasingly interplayed and optimization in different problems leads to the use of machine learning approaches‎. ‎Machine learning algorithms work in reasonable computational time for specific classes of problems and have important role in extracting knowledge from large amount of data‎. ‎In this paper‎, ‎a methodology has been employed to opt...

متن کامل

Machine learning algorithms in air quality modeling

Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...

متن کامل

A Comparative Study of SVM and RF Methods for Classification of Alteration Zones Using Remotely Sensed Data

Identification and mapping of the significant alterations are the main objectives of the exploration geochemical surveys. The field study is time-consuming and costly to produce the classified maps. Therefore, the processing of remotely sensed data, which provide timely and multi-band (multi-layer) data, can be substituted for the field study. In this study, the ASTER imagery is used for altera...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1802.04422  شماره 

صفحات  -

تاریخ انتشار 2018